AITopics | Adelphi

In recent years, reinforcement learning (RL) systems with general goals beyond a cumulative sum of rewards have gained traction, such as in constrained problems, exploration, and acting upon prior experiences. In this paper, we consider policy optimization in Markov Decision Problems, where the objective is a general concave utility function of the state-action occupancy measure, which subsumes several of the aforementioned examples as special cases.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > United States > Maryland > Prince George's County > Adelphi (0.04)
(2 more...)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Topology Identification and Inference over Graphs

Mateos, Gonzalo, Shen, Yanning, Giannakis, Georgios B., Swami, Ananthram

arXiv.org Machine LearningDec-12-2025

Topology identification and inference of processes evolving over graphs arise in timely applications involving brain, transportation, financial, power, as well as social and information networks. This chapter provides an overview of graph topology identification and statistical inference methods for multidimensional relational data. Approaches for undirected links connecting graph nodes are outlined, going all the way from correlation metrics to covariance selection, and revealing ties with smooth signal priors. To account for directional (possibly causal) relations among nodal variables and address the limitations of linear time-invariant models in handling dynamic as well as nonlinear dependencies, a principled framework is surveyed to capture these complexities through judiciously selected kernels from a prescribed dictionary. Generalizations are also described via structural equations and vector autoregressions that can exploit attributes such as low rank, sparsity, acyclicity, and smoothness to model dynamic processes over possibly time-evolving topologies. It is argued that this approach supports both batch and online learning algorithms with convergence rate guarantees, is amenable to tensor (that is, multi-way array) formulations as well as decompositions that are well-suited for multidimensional network data, and can seamlessly leverage high-order statistical information.

graph, matrix, topology identification and inference, (9 more...)

arXiv.org Machine Learning

2512.10183

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Orange County > Irvine (0.14)
North America > United States > California > Monterey County > Pacific Grove (0.04)
(10 more...)

Genre: Research Report (1.00)

Industry:

Banking & Finance (0.67)
Health & Medicine > Therapeutic Area > Neurology (0.46)
Telecommunications > Networks (0.34)
Information Technology > Networks (0.34)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Evidential Deep Learning to Quantify Classification Uncertainty

Murat Sensoy, Lance Kaplan, Melih Kandemir

Neural Information Processing SystemsNov-20-2025, 19:06:39 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, prediction, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland > Prince George's County > Adelphi (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Germany (0.04)
Asia > Middle East > Republic of Türkiye (0.04)

Industry: Government > Military (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Error Correcting Output Codes Improve Probability Estimation and Adversarial Robustness of Deep Neural Networks

Gunjan Verma, Ananthram Swami

Neural Information Processing SystemsOct-3-2025, 18:13:12 GMT

From a scientific perspective, the existence of adversarial examples demonstrates that machine learning models that achieve superhuman performance on benign, "naturally occurring"

adversarial example, arxiv preprint arxiv, probability, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland > Prince George's County > Adelphi (0.04)
North America > Canada (0.04)

Industry:

Government > Military (0.47)
Information Technology > Security & Privacy (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.83)

Add feedback

Decentralized Langevin Dynamics for Bayesian Learning

Neural Information Processing SystemsAug-16-2025, 01:06:56 GMT

The performance of the proposed algorithm is evaluated on a wide variety of machine learning tasks.

agent, algorithm, target distribution, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Oklahoma > Payne County > Stillwater (0.14)
Asia > Middle East > Jordan (0.05)
North America > United States > Maryland > Prince George's County > Adelphi (0.04)
(3 more...)

Industry: Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.83)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.65)

Add feedback

Air-Ground Collaboration for Language-Specified Missions in Unknown Environments

Cladera, Fernando, Ravichandran, Zachary, Hughes, Jason, Murali, Varun, Nieto-Granda, Carlos, Hsieh, M. Ani, Pappas, George J., Taylor, Camillo J., Kumar, Vijay

arXiv.org Artificial IntelligenceMay-15-2025

As autonomous robotic systems become increasingly mature, users will want to specify missions at the level of intent rather than in low-level detail. Language is an expressive and intuitive medium for such mission specification. However, realizing language-guided robotic teams requires overcoming significant technical hurdles. Interpreting and realizing language-specified missions requires advanced semantic reasoning. Successful heterogeneous robots must effectively coordinate actions and share information across varying viewpoints. Additionally, communication between robots is typically intermittent, necessitating robust strategies that leverage communication opportunities to maintain coordination and achieve mission objectives. In this work, we present a first-of-its-kind system where an unmanned aerial vehicle (UAV) and an unmanned ground vehicle (UGV) are able to collaboratively accomplish missions specified in natural language while reacting to changes in specification on the fly. We leverage a Large Language Model (LLM)-enabled planner to reason over semantic-metric maps that are built online and opportunistically shared between an aerial and a ground robot. We consider task-driven navigation in urban and rural areas. Our system must infer mission-relevant semantics and actively acquire information via semantic mapping. In both ground and air-ground teaming experiments, we demonstrate our system on seven different natural-language specifications at up to kilometer-scale navigation.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2505.09108

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
Europe > Italy (0.04)
North America > United States > Maryland > Prince George's County > Adelphi (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.64)

Industry:

Government > Military (0.89)
Information Technology > Robotics & Automation (0.66)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.93)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
(2 more...)

Add feedback

Value of Information-based Deceptive Path Planning Under Adversarial Interventions

Suttle, Wesley A., Milzman, Jesse, Karabag, Mustafa O., Sadler, Brian M., Topcu, Ufuk

arXiv.org Artificial IntelligenceMar-31-2025

V alue of Information-based Deceptive Path Planning Under Adversarial Interventions Wesley A. Suttle, Jesse Milzman, Mustafa O. Karabag, Brian M. Sadler, Ufuk Topcu Abstract -- Existing methods for deceptive path planning (DPP) address the problem of designing paths that conceal their true goal from a passive, external observer . Such methods do not apply to problems where the observer has the ability to perform adversarial interventions to impede the path planning agent. In this paper, we propose a novel Markov decision process (MDP)-based model for the DPP problem under adversarial interventions and develop new value of information (V oI) objectives to guide the design of DPP policies. Using the V oI objectives we propose, path planning agents deceive the adversarial observer into choosing suboptimal interventions by selecting trajectories that are of low informational value to the observer . Leveraging connections to the linear programming theory for MDPs, we derive computationally efficient solution methods for synthesizing policies for performing DPP under adversarial interventions. In our experiments, we illustrate the effectiveness of the proposed solution method in achieving deceptiveness under adversarial interventions and demonstrate the superior performance of our approach to both existing DPP methods and conservative path planning approaches on illustrative gridworld problems. I NTRODUCTION Deceptive path planning (DPP) is the problem of designing a path that conceals its true objective from an outside observer. Several approaches to this problem have recently been developed, using model-based planning [1], [2], [3], [4] and model-free reinforcement learning [5], [6], [7], [8]. These methods make the strong assumption that the observer is passive and unable to affect the path planning agent's environment, however, significantly limiting their applicability.

artificial intelligence, intervention, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2503.24284

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > Maryland > Prince George's County > Adelphi (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.66)

Add feedback

Learning Multi-Robot Coordination through Locality-Based Factorized Multi-Agent Actor-Critic Algorithm

Shek, Chak Lam, Bedi, Amrit Singh, Basak, Anjon, Novoseller, Ellen, Waytowich, Nick, Narayanan, Priya, Manocha, Dinesh, Tokekar, Pratap

arXiv.org Artificial IntelligenceMar-28-2025

In this work, we present a novel cooperative multi-agent reinforcement learning method called \textbf{Loc}ality based \textbf{Fac}torized \textbf{M}ulti-Agent \textbf{A}ctor-\textbf{C}ritic (Loc-FACMAC). Existing state-of-the-art algorithms, such as FACMAC, rely on global reward information, which may not accurately reflect the quality of individual robots' actions in decentralized systems. We integrate the concept of locality into critic learning, where strongly related robots form partitions during training. Robots within the same partition have a greater impact on each other, leading to more precise policy evaluation. Additionally, we construct a dependency graph to capture the relationships between robots, facilitating the partitioning process. This approach mitigates the curse of dimensionality and prevents robots from using irrelevant information. Our method improves existing algorithms by focusing on local rewards and leveraging partition-based learning to enhance training efficiency and performance. We evaluate the performance of Loc-FACMAC in three environments: Hallway, Multi-cartpole, and Bounded-Cooperative-Navigation. We explore the impact of partition sizes on the performance and compare the result with baseline MARL algorithms such as LOMAQ, FACMAC, and QMIX. The experiments reveal that, if the locality structure is defined properly, Loc-FACMAC outperforms these baseline algorithms up to 108\%, indicating that exploiting the locality structure in the actor-critic framework improves the MARL performance.

artificial intelligence, machine learning, reinforcement learning, (12 more...)

arXiv.org Artificial Intelligence

2503.18816

Country: